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SELECT A DOCUMENT 



354 



352 




350 



SELECT CHARACTER 
SEQUENCES THAT WILL 

SEPARATE WORDS. 
DEFAULT: WHITE SPACE 
ONLY(SPACE,TAB, 
END-OF- LINE) 



358 



356 



0 



RETAIN OR ELIMINATE 
PUNCTUCATION 
CHARACTERS 
(PERIOD, COMMA, COLON, 
SEMI-COLON, ETC.). 
DEFAULT RETAIN ALL 
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360 



SET REGULAR 
EXPRESSIONS THAT WILL 
CHARACTERIZE NUMBERS. 
DEFAULT: INTEGERS. 
FLOATS AND DATES 
EMBEDDED IN TEXT ARE 
CONSIDERED NUMBERS, 
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SET RANGE 1,2,3 
DEFAULT = 1 
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SET CASE BEHAVIOR 
DEFAULT: CONVERT 
CHARACTERS TO LOWER 
CASE. 
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SET OFFSET AND FACTOR 
FOR EACH NUMERIC CLASS 
(INTEGER, FLOAT, DATE, ...) 
DEFAULT: OFFSET = 0, 
FACTOR = 1 
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CONVERT DOCUMENT TO 
CHARACTER SEQUENCE 
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SP 



GO TO FIRST WORD OR 
PUNCTUATION 




CONVERT NUMBER INTO A 
SEQUENCE OF WORDS WN 
FOR FINGERPRINTING 
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SP 



GO TO "NUMBER 
NORMALIZATION 
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MARK THE POSITION AND 
LENGTH OF W (OR EACH 
ITEM WN) FOR LATER 
FINGERPRINTING 
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GO TO THE NEXT WORD OR 
PUNCTUATION CHARACTER 
W 
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"Replacement Sheet" 
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START 
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CREATE A REGION "A" THAT 
ENCOMPASSES AT LEAST 
ONE DOCUMENT, ON ONE 
OR MORE NETS (cf BLOCK 
312 OF MAIN FLOWCHART) 
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MAP REGION "A" TO A DOCUMENT 
SET BY SELECTING THE UNION 
OR INTERSECTION OF 
DOCUMENTS ON DIFFERENT 
NETS. 
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CREATE A REGION "B" THAT 
ENCOMPASSES AT LEAST 
ONE DOCUMENT, ON ONE 
OR MORE NETS (cf BLOCK 
312 OF MAIN FLOWCHART) 
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TO FIG. 16B 



